Merge Join

The Join node is used to create joins between 2 tables, adding the join to the database schema. The Join node can be connected to any Select (excluding Multi Select), Preparation, and Column Operation nodes. The Join function combines columns (horizontally) from 2 tables, based on matching columns and according to the selected join type.

The Join node can be used to join tables from the datasource, or to join datasource tables with new tables generated by data cleansing and preparation functions. For example, both the Date Range and Summarize functions generate new tables; these can then be joined to the original table from the datasource. At other times, you may have tables showing similar information that might be more user friendly for data analysts if combined into one table.

Note: relationship 'joins' between tables can defined during the Data Model phase. These joins will be added at runtime during the query, rather than being stored in the database.

Create a Join

Connect the Join node to the 2 tables to be joined. With the Join node selected, go to the Properties panel to configure the join:

Resulting Table Name: name the new table.

Join Type: choose the join type (see below).

Join Columns: for each table, select the column by which to apply the join.

Join Types

Choose from the following join types:

  • Full Outer Join: returns all rows from both tables.
  • Inner Join: returns all rows from both tables based on a matching column. Only returns rows where the matching column's values are identical.
  • Left Outer Join: returns all rows from the first/ left table, and matching rows from the second/ right table.
  • Right Outer Join: returns all rows from the second/ right table, and matching rows from the first/ left table.
  • Cross Join: returns a result set by multiplying the number of rows in the first table by the number of rows in the second table. The result is a table that returns all possible combinations of all rows from both tables.

Examples